Multi-GPU systems and Unified Virtual Memory for scientific applications: The case of the NAS multi-zone parallel benchmarks

نویسندگان

چکیده

• Multi-GPU and Unified Memory implementation of the Multi-Zone NAS Benchmarks. Analysis programmability performance effects Memory. per-GPU allocation have similar programming efforts. Unified-Memory version outperforms manual from 1.1x to 1.85x. GPU-based computing systems become a widely accepted solution for high-performance-computing (HPC) domain. GPUs shown highly competitive performance-per-watt ratios can exploit an astonishing level parallelism. However, exploiting peak such devices is challenge, mainly due combination two essential aspects multi-GPU execution: memory work distribution. determines data mapping GPUs, therefore conditions all distribution schemes communication phases in application. Virtual simplifies codification allocations, but its on depend how used by devices' driver going orchestrate transfers across system. In this paper we present (UM) Parallel Benchmarks which alternate computation offering opportunities overlap these phases. We analyse introduction UM support. Our experience shows that efforts introducing are those having per GPU. On evaluation environment composed 2 x IBM Power9 8335-GTH 4 GPU NVIDIA V100 (Volta), our UM-based parallelization versions 1.10x improvements sensitive information forwarded describing most convenient location specific regions. terms relationship between computational applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Nas Parallel Benchmarks

A new set of benchmarks has been developed for the performance evaluation of highly parallel supercomputers. These benchmarks consist of five parallel kernels and three simulated application benchmarks. Together they mimic the computation and data movement characteristics of large scale computational fluid dynamics (CFD) applications. The principal distinguishing feature of these benchmarks is ...

متن کامل

Characterizing Shared-Memory Applications: A Case Study of the NAS Parallel Benchmarks

The objective of this report is to present our characterization of a shared-memory implementation of the NAS Parallel Benchmarks (NPB). This characterization is needed to support the design decisions of future shared-memory multiprocessors. This report presents two sets of characterization data; the rst set is the application characteristics that do not change from one hardware connguration to ...

متن کامل

Virtual machine workloads: the case for new benchmarks for NAS

Network Attached Storage (NAS) and Virtual Machines (VMs) are widely used in data centers thanks to their manageability, scalability, and ability to consolidate resources. But the shift from physical to virtual clients drastically changes the I/O workloads seen on NAS servers, due to guest file system encapsulation in virtual disk images and the multiplexing of request streams from different VM...

متن کامل

the survey of the virtual higher education in iran and the ways of its development and improvement

این پژوهش با هدف "بررسی وضعیت موجود آموزش عالی مجازی در ایران و راههای توسعه و ارتقای آن " و با روش توصیفی-تحلیلی و پیمایشی صورت پذیرفته است. بررسی اسنادو مدارک موجود در زمینه آموزش مجازی نشان داد تعداد دانشجویان و مقاطع تحصیلی و رشته محل های دوره های الکترونیکی چندان مطلوب نبوده و از نظر کیفی نیز وضعیت شاخص خدمات آموزشی اساتید و وضعیت شبکه اینترنت در محیط آموزش مجازی نامطلوب است.

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Parallel and Distributed Computing

سال: 2021

ISSN: ['1096-0848', '0743-7315']

DOI: https://doi.org/10.1016/j.jpdc.2021.08.001